A guide to evaluating linkage quality for the analysis of linked data

نویسندگان

  • Katie L Harron
  • James C Doidge
  • Hannah E Knight
  • Ruth E Gilbert
  • Harvey Goldstein
  • David A Cromwell
  • Jan H van der Meulen
چکیده

Linked datasets are an important resource for epidemiological and clinical studies, but linkage error can lead to biased results. For data security reasons, linkage of personal identifiers is often performed by a third party, making it difficult for researchers to assess the quality of the linked dataset in the context of specific research questions. This is compounded by a lack of guidance on how to determine the potential impact of linkage error. We describe how linkage quality can be evaluated and provide widely applicable guidance for both data providers and researchers. Using an illustrative example of a linked dataset of maternal and baby hospital records, we demonstrate three approaches for evaluating linkage quality: applying the linkage algorithm to a subset of gold standard data to quantify linkage error; comparing characteristics of linked and unlinked data to identify potential sources of bias; and evaluating the sensitivity of results to changes in the linkage procedure. These approaches can inform our understanding of the potential impact of linkage error and provide an opportunity to select the most appropriate linkage procedure for a specific analysis. Evaluating linkage quality in this way will improve the quality and transparency of epidemiological and clinical research using linked data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating the effects of guide plans on the mental health of villagers (Case study: Central district of Darab County)

Various developmental proceedings of conducting guide plans have been considered as one of the main prerequisites for social welfare by providing a basis for comprehensive development and improvement of living conditions in the rural areas and promoting the mental health of villagers. Accordingly, the present research seeks to evaluate the effects of guide plans on the mental health status of v...

متن کامل

Identification of Linked Markers for Delayed Fruit Ripening in Tomato Using Simple Sequence Repeat (SSR) Markers

Tomato (Solanum lycopersicum L.) is an important vegetable crop and acts as model plant for fruit development studies. Besides that, post-harvest damage is a devastating phenomenon often associated with ripening process in tomato which in turn leads to greater yield loss. Understanding the genetics, molecular and biochemical pathways is the key to overcome the existing situation. In th...

متن کامل

High Frequency of IVS10nt546 Linked to VNTR8 in Iranian PKU Patients from Fars Province

Dear Editor Analysis of the phenylalanine hydroxylase (PAH McKusick 261600) gene in different populations has revealed more than 320 different mutations associated with phenylketonuria (PKU). One of these mutations, IVS10nt546, results in severe PAH deficiency due to defective mRNA splicing. It accounts for about 40 percent of all mutant alleles in Turkish and between 10 to 20 percent of all mu...

متن کامل

Genetic Heterogeneity of PKD1 and PKD2 Genes in Iran and Determination of the Genotype/Phenotype Correlations in Several Families with Autosomal Dominant Polycystic Kidney Disease

Autosomal dominant polycystic kidney disease (ADPKD) is the most common genetic nephropathy, which is characterized by replacement of renal parenchyma with multiple cysts. In Iran, the disease prevalence within the chronic hemodialysis patient population is approximately 8-10%. So far, three genetic loci have been identified to be responsible for ADPKD. Little information is available concernin...

متن کامل

Structural-Functional Analysis of Rural Guide Plans in Improving the Quality of Villagers’ vitality in Panj Hezareh Rural Area located in central part of Behshahr County

Promoting and improving the quality of villagers’ vitality is looked upon as the main objective of implementing rural development programs and from which the most important ones, the guide plans, are considered as the most relevant local development plans. In this regard, it has been of vital importance to implement the necessary evaluation after several years of conducting the guide plans. Thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 46  شماره 

صفحات  -

تاریخ انتشار 2017